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Abstract 

We consider the following "multiway cut packing" problem in undirected graphs: we are given a 
graph G = {V, E) and k commodities, each corresponding to a set of terminals located at different 
vertices in the graph; our goal is to produce a collection of cuts , • • ■ ^Ek) such that Ei is a multiway 
cut for commodity i and the maximum load on any edge is minimized. The load on an edge is defined 
to be the number of cuts in the solution crossing the edge. In the capacitated version of the problem 
edges have capacities and the goal is to minimize the maximum relative load on any edge - the ratio 
of the edge's load to its capacity. We present the first constant factor approximations for this problem in 
arbitrary undirected graphs. The multiway cut packing problem arises in the context of graph labeling 
problems where we are given a partial labeling of a set of items and a neighborhood structure over them, 
and, informally stated, the goal is to complete the labeling in the most consistent way. This problem 
was introduced by Rabani, Schulman, and Swamy (SODA'08), who developed an 0(logn/loglogn) 
approximation for it in general graphs, as well as an improved 0(log^ k) approximation in trees. Here n 
is the number of nodes in the graph. 

We present an LP-based algorithm for the multiway cut packing problem in general graphs that guar- 
antees a maximum edge load of at most 80PT + 4. Our rounding approach is based on the observation 
that every instance of the problem admits a laminar solution (that is, no pair of cuts in the solution 
crosses) that is near-optimal. For the special case where each commodity has only two terminals and 
all commodities share a common sink (the "common sink s-t cut packing" problem) we guarantee a 
maximum load of OPT + 2. Both of these variants are NP-hard; for the common-sink case our result is 
nearly optimal. 



*The conference version of this paper is to appear at SODA 2009. This is the full version. 
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1 Introduction 



We study the multiway cut packing problem (MCP) introduced by Rabani, Schulman and Swamy 191 . In this 
problem, we are given k instances of the multiway cut problem in a common graph, each instance being a set 
of terminals at different locations in the graph. Informally, our goal is to compute nearly-disjoint multiway 
cuts for each of the instances. More precisely, we aim to minimize the maximum number of cuts that any 
single edge in the graph belongs to. In the weighted version of this problem, different edges have different 
capacities; the goal is to minimize the maximum relative load of any edge, where the relative load of an edge 
is the ratio of the number of cuts it belongs to and its capacity. 

The multiway cut packing problem belongs to the following class of graph labeling problems. We 
are given a partially labeled set of n items along with a weighted graph over them that encodes similarity 
information among them. An item's label is a string of length k where each coordinate of the string is 
either drawn from an alphabet S, or is undetermined. Roughly speaking, the goal is to complete the partial 
labeling in the most consistent possible way. Note that completing a single specific entry (coordinate) of 
each item label is like finding what we call a "set multiway cut" — for cr € S let S*^ denote the set of nodes 
for which the ith coordinate is labeled a in the partial labeling, then a complete and consistent labeling for 
this coordinate is a partition of the items into parts such that the fi"^ part contains the entire set S*^. The 
cost of the labeling for a single pair of neighboring items in the graph is measured by the Hamming distance 
between the labels assigned to them. The overall cost of the labeling can then be formalized as a certain 
norm of the vector of (weighted) edge costs. 

Different choices of norms for the overall cost give rise to different objectives. Minimizing the ii norm, 
for example, is the same as minimizing the sum of the edge costs. This problem decomposes into finding 
k minimum set multiway cuts. Each set multiway cut instance can be reduced to a minimum multiway cut 
instance by simply merging all the items in the same set into a single node in the graph, and can therefore 
be approximated to within a factor of 1.5 HI. On the other hand, minimizing the £oo norm of edge costs 
(equivalently, the maximum edge cost) becomes the set multiway cut packing problem. Formally, in this 
problem, we are given k set multiway cut instances S^, - ■ ■ ,S^, where each = SI x x ■ ■ ■ x S^^^. The 
goal is to find k cuts, with the ith cut separating every pair of terminals that belong to sets Sj^ and S^^ with 
ji 7^ j2, such that the maximum (weighted) cost of any edge is minimized. When \Sj\ = 1 for all i G [A;] 
and j G S, this is the multiway cut packing problem. 

To our knowledge Rabani et al. [9] were the first to consider the multiway cut packing problem and 
provide approximation algorithms for it. They used a linear programming relaxation of the problem along 
with randomized rounding to obtain an 0{ ^^°f^^ ) approximation, where n is the number of nodes in the 
given graphQ. This approximation ratio arises from an application of the Chernoff bounds to the randomized 
rounding process, and improves to an 0(1) factor when the optimal load is il(logn). When the underlying 
graph is a tree, Rabani et al. use a more careful deterministic rounding technique to obtain an improved 
0(log^ k) approximation. The latter approximation factor holds also for a more general multicut packing 
problem (described in more detail below). One nice property of the latter approximation is that it is inde- 
pendent of the size of the graph, and remains small as the graph grows but k remains fixed. Then, a natural 
open problem related to their work is whether a similar approximation guarantee independent of n can be 

'Rabani et al. claim in their paper that the same approximation ratio holds for the set multiway cut packing problem that arises 
in the context of graph labelings. However their approach of merging nodes with the same attribute values (similar to what we 
described above for minimizing the £i norm of edge costs) does not work in this case. Roughly speaking, if nodes u and v have the 
same ith attribute, and nodes v and w have the same jth attribute, then this approach merges all three nodes, although an optimal 
solution may end up separating u from w in some of the cuts. We are not aware of any other approximation preserving reduction 
between the two problems. 
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obtained even for general graphs. 

Our results & techniques. We answer this question in the positive. We employ the same linear pro- 
gramming relaxation for this problem as Rabani et al., but develop a very different rounding algorithm. In 
order to produce a good integral solution our rounding algorithm requires a fractional collection of cuts 
that is not only feasible for the linear program but also satisfies an additional good property — the cut col- 
lection is laminar. In other words, when interpreted appropriately as subsets of nodes, no two cuts in the 
collection "cross" each other. Given such an input the rounding process only incurs a small additive loss 
in performance — the final (absolute) load on any edge is at most 3 more than the load on that edge of the 
fractional solution that we started out with. Of course the laminarity condition comes at a cost - not ev- 
ery fractional solution to the cut packing LP can be interpreted as a laminar collection of cuts (see, e.g.. 
Figure |9l). We show that for the multiway cut problem any fractional collection of cuts can be converted 
into a laminar one while losing only a multiplicative factor of 8 and an additive o(l) amount in edge loads. 
Therefore, for every edge e we obtain a final edge load of 8£°^^ + 4, where is the optimal load on the 
edge. We only load edges with Ce > 1 and since the optimal cost is at least 1 our algorithm also obtains a 
purely multiplicative 12 approximation. 

Our laminarity based approach proves even more powerful in the special case of common-sink s-t cut 
packing problem or CSCP. In this special case every multiway cut instance has only two terminals and all 
the instances share a common sink t. We use these properties to improve both the rounding and laminarity 
transformation algorithms, and ensure a final load of at most i^^'^+l for every edge e. The CSCP is NP-hard 
(see Section [5]) and so our guarantee for this special case is the best possible. 

In converting a fractional laminar solution to an integral one we use an iterative rounding approach, 
assigning an integral cut at each iteration to an appropriate "innermost" terminal. Throughout the algorithm 
we maintain a partial integral cut collection and a partial fractional one and ensure that these collections 
together are feasible for the given multiway cut instances. As we round cuts, we "shift" or modify other 
fractional cuts so as to maintain bounds on edge loads. Maintaining feasibiUty and edge loads simultane- 
ously turns out to be relatively straightforward in the case of common-sink s-t cut packing - we only need to 
ensure that none of the cuts in the fractional or the integral collection contain the common sink t. However 
in the general case we must ensure that new fractional cuts assigned to any terminal must exclude all other 
terminals of the same multiway cut instance. This requires a more careful reassignment of cuts. 

Related work. Problems falling under the general framework of graph labeling as described above have 
been studied in various guises. The most extensively studied special case, called label extension, involves 
partial labelings in which every item is either completely labeled or not labeled at all. When the objective 
is to minimize the £i norm of edge costs, this becomes a special case of the metric labeling and 0-extension 
problems ||6l|2l|4l|5l. (The main difference between 0-extension and the label extension problem as described 
above is that the cost of the labeling in the former arises from an arbitrary metric over the labels, while in 
the latter it arises from the Hamming metric.) 

When the underlying graph is a tree and edge costs are given by the edit distance between the corre- 
sponding labels, this is known as the tree alignment problem. The tree alignment problem has been studied 
widely in the computational biology literature and arises in the context of labeling phylogenies and evolu- 
tionary trees. This version is also NP-hard, and there are several PTASes known llT3l [T2l [TTI . Ravi and 
Kececioglu ifTOl also introduced and studied the ioo version of this problem, calling it the bottleneck tree 
alignment problem. They presented an O(logn) approximation for this problem. A further special case of 
the label extension problem under the £oq objective, where the underlying tree is a star with labeled leaves, 
is known as the closest string problem. This problem is also NP-hard but admits a PTAS [7]. 

As mentioned above, the multiway cut packing problem was introduced by Rabani, Schulman and 
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Swamy Rabani et al. also studied the more general multicut packing problem (where the goal is to 
pack multicuts so as to minimize the maximum edge load) as well as the label extension problem with the 
ioo objective. Rabani et al. developed an 0(log^ A;) approximation for multicut packing in trees, and an 
0(Iog M ^J°j^g,^ ) in general graphs. Here M is the maximum number of terminals in any one multicut in- 
stance. For the label extension problem they presented a constant factor approximation in trees, which holds 
even when edge costs are given by a fairly general class of metrics over the label set (including Hamming 
distance as well as edit distance). 

Another line of research loosely related to the cut packing problems described here considers the prob- 
lem of finding the largest collection of edge-disjoint cuts (not corresponding to any specific terminals) in 
a given graph. While this problem can be solved exactly in polynomial time in directed graphs i8|, it is 
NP-hard in undirected graphs, and Caprara, Panconesi and Rizzi [3| presented a 2 approximation for it. In 
terms of approximability, this problem is very different from the one we study — in the former, the goal is to 
find as many cuts as possible, such that the load on any edge is at most 1, whereas in our setting, the goal is 
to find cuts for all the commodities, so that the maximum edge load is minimized. 

2 Definitions and results 

Given a graph G = {V,E), a. cut in G is a subset of edges E', the removal of which disconnects the graph 
into multiple connected components. A vertex partition of G is a pair (C, V \C) with C C C y. For 
a set C with C C C y, we use 6{C) to denote the cut defined by C, that is, 5{C) = {(n, v) e E : 
\C n {u, v}\ = 1}. We say that a cut E' C E separates vertices u and vifu and v lie in different connected 
components in {V, E \ E'). The vertex partition defined by set C separates u and v if the two vertices are 
separated by the cut 6{C). Given a collection of cuts £ = {Ei, ■ • • , E^} and capacities Ce on edges, the load 
on an edge e is defined as the number of cuts that contain e, that is, = \{Ei G £\e G Ei}\. Likewise, 
given a collection of vertex partitions C = {Ci, • • • , Cfe}, the load £g on an edge e is defined to be the load 
of the cut collection {6{Ci), • • • , 6{Ck)} on e. 

The input to a multiway cut packing problem (MCP) is a graph G = {V, E) with non-zero integral 
capacities Ce on edges, and k sets 5i, • • • , iSfc of terminals (called "commodities"); each terminal i G Sa 
resides at a vertex in V. The goal is to produce a collection of cuts £ = {Ei, ■ ■ ■ ,Ef^}, such that (1) for 
all a G [k], and for all pairs of terminals i,j € Sa, the cut Ea separates and rj, and (2) the maximum 
"relative load" on any edge, maxg^f /ce, is minimized. 

In a special case of this problem called the common-sink s-t cut packing problem (CSCP), the graph G 
contains a special node t called the sink and each commodity set has exactly two terminals, one of which 
resides at t. Again the goal is to produce a collection of cuts, one for each commodity such that the maximum 
relative edge load is minimized. 

Both of these problems are NP-hard to solve optimally (see Section |5]l, and we present LP-rounding 
based approximation algorithms for them. We assume without loss of generality that the optimal solution 
has a relative load of 1. The integer program [MCP-IPl below encodes the set of solutions to the MCP with 
relative load 1. 

Here Va denotes the set of all paths between any two vertices r^, rj with i, j G Sa, i 7^ J- In order to be 
able to solve this program efficiently, we relax the final constraint to Xa,e £ [0,1] for all a G [A;] and e £ E. 
Although the resulting linear program has an exponential number of constraints, it can be solved efficiently; 
in particular, the polynomial-size program [MCP-LPI below is equivalent to it. Given a feasible solution to 
this linear program, our algorithms round it into a feasible integral solution with small load. 
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da{u, v) < da{u, w) + da{w, v) Vfl £ [k],u,v,w 
^da{e)<Ce yeeE 

a 

da{e)e [0,1] Wae[k],eeE 

(MCP-LP) 



In the remainder of this paper we focus exclusively on solutions to the MCP and CSCP that are collec- 
tions of vertex partitions. This is without loss of generality (up to a factor of 2 in edge loads for the MCP) 
and allows us to exploit structural properties of vertex sets such as laminarity that help in constructing a 
good approximation. Accordingly, in the rest of the paper we use the term "cut" to denote a subset of the 
vertices that defines a vertex partition. 

A pair of cuts Ci,C2 C V is said to "cross" if all of the sets CinC2, Ci\C2, and C2\Ci are non-empty. 
A collection C = {Ci, • • • , Cfc} of cuts is said to be laminar if no pair of cuts Q, Cj £ C crosses. All of 
our algorithms are based on the observation that both the MCP and the CSCP admit near-optimal solutions 
that are laminar. Specifically, there is a polynomial-time algorithm that given a fractional feasible solution 
to MCP or CSCP (i.e. a feasible solution to IMCP-LPI ) produces a laminar family of fractional cuts that is 
feasible for the respective problem and has small load. This is formalized in Lemmas [T] and [2] below. We 
first introduce the notion of a fractional laminar family of cuts. 

Definition 1 A fractional laminar cut family C for terminal set T with weight function w is a collection of 
cuts with the following properties: 

• The collection is laminar 

• Each cut C in the family is associated with a unique terminal in T. We use Ci to denote the sub- 
collection of sets associated with terminal i & T. Every C ^ Ci contains the node ri. 

• For all i G T, the total weight of cuts in Ci, ^(^g^. w{C), is 1. 

Next we define what it means for a fractional laminar family to be feasible for the MCP or the CSCP. 
Note that for a terminal pair i ^ j belonging to the same commodity, condition (2) below is weaker than 
requiring cuts in both Ci and Cj to separate rj from rj. 

Definition 2 A fractional laminar family of cuts Cfor terminal set T with weight function w is feasible for 
the MCP on a graph G with edge capacities Cg and commodities Si, - ■ ■ ,Skif(l)T = UagjfcjS'a, (2) for all 
a € [k] and i,j ^ Sa, i ^ j, either Vj UceCiC'. or ri UceCjC, and (3) for every edge e £ E, i'^ < Cg. 

The family is feasible for the CSCP on a graph G with edge capacities Cg and commodities Si, - ■ ■ ,Sk 
if(l) T = Uae[k]Sa \ {t}. (2) t UcecC, and (3) for every e £ E, < Ce- 

Lemma 1 Consider an instance of the CSCP with graph G = (V, E), common sink t, edge capacities 
Ce, and commodities Si, - - - , Sk- Given a feasible solution d to IMCP-LPI algorithm Lam-1 produces a 
fractional laminar cut family C that is feasible for the CSCP on G with edge capacities Ce + o(l). 
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Lemma 2 Consider an instance of the MCP with graph G = {V, E), edge capacities Ce, and commodities 
Si, - ■ ■ , Sk- Given a feasible solution d to IMCP-LPI algorithm Lam-2 produces a fractional laminar cut 
family C that is feasible for the MCP on G with edge capacities Scg + o(l). 

Lemmas [T] and |2] are proven in Section H] In Section [3] we show how to deterministically round a 
fractional laminar solution to the CSCP and MCP into an integral one while increasing the load on every 
edge by no more than a small additive amount. These rounding algorithms aie the main contributions of our 
work, and crucially use the laminarity of the fractional solution. 

Lemma 3 Given a fractional laminar cut family C feasible for the CSCP on a graph G with integral edge 
capacities Cg, the algorithm Round-1 produces an integral family of cuts A that is feasible for the CSCP on 
G with edge capacities Ce + 1. 

For the MCP, the rounding algorithm loses an additive factor of 3 in edge load. 

Lemma 4 Given a fractional laminar cut family C feasible for the MCP on a graph G with integral edge 
capacities Ce, the algorithm Round-2 produces an integral family of cuts A that is feasible for the MCP on 
G with edge capacities Cg + 3. 

Combining these lemmas together we obtain the following theorem. 

Theorem 5 There exists a polynomial-time algorithm that given an instance of the MCP with graph G = 
(y, E), edge capacities Cg, and commodities 5i, • • • , Sk, produces a family A ofmultiway cuts, one for each 
commodity, such that for each e G E, < Scg + 4. 

There exists a polynomial-time algorithm that given an instance of the CSCP with graph G = (V, E), 
edge capacities Cg, and commodities Si, - ■ ■ , S^, produces a family A ofmultiway cuts, one for each com- 
modity, such that for each e G £^ < Cg + 2. 

3 Rounding fractional laminar cut families 

In this section we develop algorithms for rounding feasible fractional laminar solutions to the MCP and the 
CSCP to integral ones while increasing edge loads by a small additive amount. We first demonstrate some 
key ideas behind the algorithm and the analysis for the CSCP, and then extend them to the more general case 
of multiway cuts. Throughout the section we assume that the edge capacities Cg are integral. 

3.1 The common sink case (proof of Lemma |3]) 

Our rounding algorithm for the CSCP rounds fractional cuts roughly in the order of innermost cuts first. The 
notion of an innermost terminal is defined with respect to the fractional solution. After each iteration we 
ensure that the remaining fractional solution continues to be feasible for the unassigned terminals and has 
small edge loads. We use C to denote the fractional laminar cut family that we start out with and A to denote 
the integral family that we construct. Recall that for an edge e € E, £^ denotes the load of the fractional cut 
family C on e, and i-^ denotes the load of the integral cut family A on e. We call the former the fractional 
load on the edge, and the latter its integral load. 

We now formalize what we mean by an "innermost" terminal. For every vertex v £ V, let ivT^, denote 
the set of cuts in C that contain v. The "depth" of a vertex v is the total weight of all cuts in i^^,: dy = 
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Input: Graph G ~ (V, E) with capacities Ce, terminals T with a fractional laminar cut family C, common sink t with 

t ^ UcecC. 

Output: A collection of cuts A, one for each terminal in T. 



1. Initialize T' — T, A — %, and M{v) = {v} for aWv ^V. Compute the depths of vertices and terminals. 

2. While there are terminals in T' do: 

(a) Let i be a terminal with the maximum depth in T' . Let Ai = M{ri). Add Ai to A and remove i from T'. 

(b) Let K ~ K^. . Remove cuts in X n Ci from K, d and C. While there exists a terminal j S T' with a cut 
C £ K n Cj, do the following: let w = w{C); remove C from K, Cj and C; remove cuts in Q" from d 
and add them to Cj (that is, these cuts are reassigned from terminal i to terminal j). 

(c) If there exists an edge e = {u, v) with £g — 0, merge the meta-nodes M{u) and AI (v) (we say that the 
edge e has been "contracted"). 

(d) Recompute the depths of vertices and terminals. 



Figure 1: Algorithm Round- 1 — Rounding algorithm for common-sink s-t cut packing 

J2c'eKy w{C). The depth of a terminal is defined as the depth of the vertex at which it resides. Terminals 
are picked in order of decreasing depth. 

Before we describe the algorithm we need some more notation. At any point during the algorithm we 
use Se to denote the set of cuts crossing an edge e. As the algorithm proceeds, the integral loads on edges 
increase while their fractional loads decrease. Whenever the fractional load of an edge becomes 0, we merge 
its end-points to form "meta-nodes". At any point of time, we use M{v) to denote the meta-node containing 
a node v ^V. 

Finally, for a set of fractional cuts L = {Li, • • • , L/} with Li C L2 C • • • C L/ and weight function w, 
we use to denote the subset of L containing the innermost cuts with weight exactly x. That is, let V be 
such that Yla<i' '^(^a) < X and '}2a<v ^(-^a) ^ X. Then is the set {Li, • • • , L;'} with weight function 
w' such that w'{La) = w{La) for a < I' and w'{Li') = x — Yla<i' '^i^a)- 

The algorithm Round-1 is given in Figure[I] At every step, the algorithm picks a terminal, say i, with the 
maximum depth and assigns an integral cut to it. This potentially frees up capacity used up by the fractional 
cuts of i, but may use up extra capacity on some edges that was previously occupied by fractional cuts 
belonging to other terminals. In order to avoid increasing edge loads, we reassign to terminals in the latter 
set, fractional cuts of i that have been freed up. 

Our analysis has two parts. Lemma |6] shows that the family C continues to remain feasible, that is it 
always satisfy the first two conditions in Definition |2] for the unassigned terminals. Lemma |7] analyzes the 
total load of the fractional and integral families as the algorithm progresses. 

Lemma 6 Throughout the algorithm, the cut family C is a fractional laminar family for terminals in T' with 
t UcecC. 

Proof: We prove this by induction over the iterations of the algorithm. The claim obviously holds at the 
beginning of the algorithm. Consider a step at which some terminal i is assigned an integral cut. The 
algorithm removes all the cuts in K = K}. from C. Some of these cuts belong to other terminals; those 
terminals are reassigned new cuts. Specifically, we first remove cuts in ETn Cj from the cut family. The total 
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weight of the remaining cuts in K as well as the total weight of those in Ci is equal at this time. Subsequently, 
we successively consider terminals j with a cut C G K n Cj, and let w = w{C). Then we remove C from 
the cut family, and reassign cuts of total weight w in Cf to j. Therefore, the total weight of cuts assigned to 
j remains 1. Furthermore, the newly reassigned cuts contain the cut C, and therefore the vertex Vj, but do 
not contain the sink t. Therefore, C continues to be a fractional laminar family for terminals in T' . ■ 

Lemma 7 At any point of time for every edge e E E, i'^ < Ce — 1 implies l-^ + < Ce, l"^ = Ce implies 
£g < 1, and i'^ = Ce + 1 implies = 0. Furthermore, for e = {u, v), £^ = Ce implies that either H Se 
or Ky n Se is empty. 

Proof: Let e = {u,v). We prove the lemma by induction over time. Note that in the beginning of the 
algorithm, we have for all edges £g < Cg and i-^ = 0, so the inequality + ^g < Cg holds. 

Let us now consider a single iteration of the algorithm and suppose that the integral load of the edge 
increases during this iteration. (If it doesn't increase, since £g only decreases over time, the claim continues 
to hold.) Let i be the commodity picked by the algorithm in this iteration, then M{ri) is the same as either 
M(u) or M{v). Without loss of generality assume that rj G M{u). Let a denote the total weight of cuts 
in Ku n Se and /3 denote the total weight of cuts in n Se prior to this iteration. Then, a + (3 = £g. 
Moreover, all cuts in C \ 5g either contain both or neither of u and v. So we can relate the depths of v and u 
in the following way: = du — a + j3. Since i is the terminal picked during this iteration, we must have 
du > dy, and therefore, a > (3. 

We analyze the final edge load depending on the value of a. Two cases arise: suppose first that a > 1. 
Then C Ku n Se, and the fractional weight of e reduces by exactly 1. On the other hand, the integral 
load on the edge increases by 1, and so the total load continues to be the same as before. On the other hand, 
if a < 1, then H S'g C K^, and all the cuts in n S'g get removed from Se in this iteration. Therefore 
the final fractional load is at most P < a < 1, and at the end of the iteration, Ku n S'g = 0. If £^ < Cg — 1, 
we immediately get that the total load on the edge is at most Cg. 

If i-^ = Cg, then prior to this iteration = Cg — 1, and so £g < 1 by the induction hypothesis. Then, as 
we argued above, a < ^g < 1 implies that the new fractional load on the edge is at most 1 and at the end of 
the iteration, K„ n S'g = 0. 

Finally, if £^ = Cg + 1, then prior to this iteration, i-^ = Cg and by the induction hypothesis, /3 is zero 
(as a > /3 and either Ku H 5g or Ky n 5g is empty). Along with the fact that a < I (by the inductive 
hypothesis), the final fractional load on the edge is /3 = 0. ■ 

The two lemmas together give us a proof of Lemma[3l We restate the lemma for completeness. 

Lemma 3 Given a fractional laminar cut family C feasible for the CSCP on a graph G with integral edge 
capacities Cg, the algorithm Round-1 produces an integral family of cuts A that is feasible for the CSCP on 
G with edge capacities Cg + 1. 

Proof: First note that for every i, Ai is set to be the meta-node of at some point during the algorithm, 
which is a subset of every cut in Ci at that point of time. Then Vi ^ Ai, and by Lemma [6l t ^ Ai. Second, 
for any edge e, its integral load £^ starts out at being and gradually increases by at most an additive 1 at 
every step, while its fractional load decreases. Once the fractional load of an edge becomes zero, both its 
end points belong to the same meta-node, and so the edge never gets loaded again. Therefore, by Lemma|7J 
the maximum integral load on any edge e is at most Cg + 1. ■ 
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3.2 The general case (proof of Lemma H]) 

As in the common-sink case, the rounding algorithm for the MCP proceeds by picking terminals according 
to an order suggested by the fractional solution and assigning the smallest cuts possible to them subject to 
the availability of capacity on the edges. In the algorithm Round-1, we reassign cuts among terminals at 
every iteration so as to maintain the feasibility of the remaining fractional solution. In the case of MCP, 
this is not sufficient — a simple reassignment of cuts as in the case of algorithm Round-1 may not ensure 
separation among terminals belonging to the same commodity. We use two ideas to overcome this difficulty: 
first, among terminals of equal depth, we use a different ordering to pick the next terminal to minimize the 
need for reassigning cuts; second, instead of reassigning cuts, we modify the existing fractional cuts for 
unassigned terminals so as to remain feasible while paying a small extra cost in edge load. 

We now define the "cut-inclusion" ordering over terminals. For every terminal z G T, let Oj denote the 
largest (outermost) cut in Cj, that is, VC G Cj, C C Oj. We say that terminal i dominates (or precedes) 
terminal j in the cut-inclusion ordering, written i >ci j, if Oi C Oj (if Oj = Oj we break ties arbitrarily 
but consistently). Cut-inclusion defines a partial order on terminals. Note that we can pre-process the cut 
family C by reassigning cuts among terminals, such that for all pairs of terminals i,j € T with i >ci j, 
and for all cuts Cj G Cj and Cj G Cj with ri,rj G Cj n Cj, we have Cj C Cj. We call this property 
the "inclusion invariant". Ensuring this invariant requires a straightforward pairwise reassignment of cuts 
among the terminals, and we omit the details. Note that following this reassignment, for every terminal i, 
the new outermost cut of i, Oi, is the same as or a subset of its original outermost cut. 

As the algorithm proceeds we modify the collection C as well as build up the collection A of integral 
cuts Ai for i £ T. For example, we may split a cut C into two cuts containing the same nodes as C and with 
weights summing to that of C. As cuts in C are modified, their ownership by terminals remains unchanged, 
and we therefore continue using the same notation for them. Furthermore, if for two cuts Ci and C2, we have 
(for example) Ci C C2 at the beginning of the algorithm, this relationship continues to hold throughout the 
algorithm. This implies that the inclusion invariant continues to hold throughout the algorithm. We ensure 
that throughout the execution of the algorithm the cut family C continues to be a fractional laminar family for 
terminals T'. At any point of time, the depth of a vertex or a terminal, as well as the cut-inclusion ordering 
is defined with respect to the current fractional family C. 

As before, let Se denote the set of cuts in C that cross e — Se = {C G C\e G S{C)}. Recall that Ky 
denotes the set of cuts in C containing the vertex v, and of these denotes the inner-most cuts with total 
weight exactly 1. 

The rounding algorithm is given in Figure [3l Roughly speaking, at every step, the algorithm picks a 
maximum depth terminal i and assigns the cut M (rj) to it (recall that M{ri) is the meta-node of the vertex 
Vi where terminal i resides). It "pays" for this cut using fractional cuts in K}.. Of course some of the cuts 
in K}. belong to other commodities, and need to be replaced with new fractional cuts. The cut-inclusion 
invariant ensures that these other commodities reside at meta-nodes other than M(rj), so we modify each 
cut in K}._ \ Ci by removing M(rj) from it (see Figure|2l). This process potentially increases the total loads 
on edges incident on M(rj) by small amounts, but on no other edges. Step[3c]of the algorithm deals with 
the case in which edges incident on M(rj) are already overloaded; In this case we avoid loading those edges 
further by assigning to i some subset of the meta-node M(rj). Lemmas [T2] and [T3] show that this case does 
not arise too often. 

For a terminal i and edge e, if at the time that i is picked in Step [3a] of the algorithm e is in 5{M{ri)), 
we say that i accesses e. If e G Ei, we say that i defaults on e, and if e is in 5{Ai) after this iteration, then 
we say that i loads e. 

During the course of the algorithm integral loads on edges increase, but fractional loads may increase or 
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- - - cuts in Kr^ \ Ci 




Figure 2: An iteration of algorithm Round-2 (Steps 3b & 3d) 



decrease. To study how these edge loads change during the course of the algorithm, we divide edges into five 
sets. Let X_i denote the set of edges with < Ce — 1 and > 0. For a G {0, 1}, let Xa denote the set of 
edges with £-f = Ce + a and > 0. Y denotes the set of edges with £^ > Cg + 2 and £g > 0, and Z denotes 
the set of edges with £g = 0. Every edge starts out with a zero integral load. As the algorithm proceeds, the 
edge goes through one or more of the XaS, may enter the set Y, and eventually ends up in the set Z. As for 
the CSCP, when an edge enters Z, we merge the end-points of the edge into a single meta-node. However, 
unlike for the CSCP, edges may get loaded even after entering Z. When an edge enters Y, we avoid loading 
it further (Step[3cl), and instead load some edges in Z. Nevertheless, we ensure that edges in Z are loaded 
no more than once. 

As before our analysis has two components. First we show (Lemma [8]) that the cuts produced by the al- 
gorithm are feasible. The following lemmas give the desired guarantees on the edges' final loads: Lemmas|9] 
and [To] analyze the loads of edges in Xa for a € {—1, 0, 1}; LemmafTT] analyzes edges in Y and Lemmas [T2l 
and [13] analyze edges in Z. We put everything together in the proof of Lemma H] at the end of this section. 

Lemma 8 For all i, ri ^ Ai CL Oi. 

Proof: Each cut Ai is set equal to the meta-node of ri at some stage of the algorithm. Therefore, rj G Ai for 
all i. Furthermore, at the time that i is assigned an integral cut, Ai C M(rj) C Oi. ■ 

Next we prove some facts about the fractional and integral loads as an edge goes through the sets Xa- 
The proofs of the following two lemmas are similar to that of Lemma [7] 

Lemma 9 At any point of time, for every edge e € X-i, i'^ + < Cg. 

Proof: We prove the claim by induction over time. Note that in the beginning of the algorithm, we have for 
all edges £g < Cg and = 0, so the inequality (.-^ + < Ce holds. 

Let us now consider a single iteration of the algorithm and suppose that the edge e remains in the set 
X_i after this step. There are three events that influence the load of the edge e = (n, v): (1) a terminal 
at some vertex in M{u) accesses e; (2) a terminal at M{v) accesses e; and, (3) a terminal at some other 
meta-node M ^ M{u),M{v) is assigned an integral cut. Let us consider the third case first, and suppose 
that a terminal i is assigned. Since Ai C M and therefore e ^ S{Ai) its integral load does not increase. 
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Input: Graph G — (V, E) with capacities Cg on edges, a set of terminals T with a fractional laminar cut family C. 
Output: A collection of cuts A, one for each terminal in T. 



1 . Preprocess the family C so that it satisfies the inclusion invariant. 

2. Initiahze T' ^T, A = %,Y,Z = %, and M{v) = {v} for all v £V. 

3. While there are terminals in T' do: 

(a) Consider the set of unassigned terminals with the maximum depth, and of these let i G T' be a terminal 
that is undominated in the cut inclusion ordering. Let Ei — Y D d{M{ri)). 

(b) If^, = 0,letA, = A/(r,). 

(c) If Ei ^ (we say that the terminal has "defaulted" on edges in Ei), let Ui denote the set of end-points of 
edges in Ei that lie in M (ri). If ri G Ui, abort and return error. Otherwise, consider the vertex in Ui that 
entered M{ri) first during the algorithm's execution, call this vertex Ui. Set Ai to be the meta-node of 
just prior to the iteration where M{ui) becomes equal to M [ri). 

(d) Add A, to A Remove from C and i from T'. Foreveryj e T'andC G nCj, let C = C\{Af(r,)}. 

(e) If for some edge e, — + 2 and > 0, add e to Y. If there exists an edge e — {u, v) with = 0, 
merge the meta-nodes M (u) and M{v) (we say that the edge e has been "contracted".) Add all edges e 
with £g = to Z and remove them from Y. 

(f) Recompute the depths of vertices and terminals. 



Figure 3: Algorithm Round-2 — Rounding algorithm for multiway cut packing 

However, in the event that Se H Cj is non-empty, the fractional load on e may decrease (because cuts in Ci 
are removed from C). Therefore, the inequality continues to hold. 

Next we consider the case where a terminal, say i, with G M{u) accesses e (the second case is 
similar). Note that M{ri) = M{u). In this case the integral load of the edge e potentially increases by 1 (if 
the terminal loads the edge). By the definition of the new integral load on this edge is no more than 
Ce — 1. The fractional load on e changes in three ways: 

• Cuts in Ci n Se are removed from C, decreasing £f . 

• Some of the cuts in [Kl. \ Ci) \ Se get "shifted" on to e increasing (we remove the meta-node 
M{ri) from these cuts, and they may continue to contain M{v)). 

• Cuts in {K}. \ Ci)r\Se get shifted off from e decreasing (these cuts initially contain M{ri) but not 
M{v), and during this step we remove M{ri) from these cuts). 

So the decrease in £^ is at least the total weight of K^, n Se = n Se, whereas the increase is at most the 
total weight of \Se = Ki\Se. 

In order to account for the two terms, let a denote the total weight of cuts in n Se, and (3 denote the 
total weight of cuts in n Se- Then, q + /3 = £f . As in the proof of Lemma|7j we have = du — a + (3, 
and therefore du > di, implies a > (3. Now, suppose that a > 1. Then K}^ C Se- Therefore, the decrease 
in due to the sets iC^ n 5e = K}^ is at least 1, and there is no corresponding increase, so the sum l-^ + 
remains at most Cg. 
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Finally, suppose that a < 1. Then K^^ contains all the cuts in H Se, the weight of K^^ n Se is exactly 
a, and so the decrease in is at least a. Moreover, the total weight of \ S'e is 1 — a, therefore, the 
increase in £g due to the sets in \ Se is at most 1 — a. Since £^ starts out as being equal to a + (3, its 
final value after this step isl — a + /3<las/3<a. Noting that £^ is at most Cg — 1 after the step, we get 
the desired inequality. ■ 

Lemma 10 For any edge e = {u, v), from the time that e enters Xq to the time that it exits Xi, < 1. 
Furthermore suppose (without loss of generality) that during this time in some iteration e is accessed by 
a terminal i with ri € M{u), then following this iteration until the next time that e is accessed, we have 
Se n Ku = 0, and the next access to e (if any) is from a terminal in M{v). 

Proof: First we note that if the lemma holds the first time an edge e = (n, v) enters a set Xa, a € {0, 1}, then 
it continues to hold while the edge remains in Xa- This is because during this time the integral load on the 
edge does not increase, and therefore throughout this time we assign integral cuts to terminals at meta-nodes 
different from M{u) and M{v) — this only reduces the fractional load on the edge e and shrinks the set Se- 

Consider the first time that an edge e = (n, v) moves from the set X_i to Xq. Suppose that at this step 
we assign an integral cut to a terminal i residing at node G M{u). Prior to this step, l-^ = Ce — 1, and so 
by Lemma|9j £f < 1. As before define a to be the total weight of cuts Ku H Se, and (5 to be the total weight 
of cuts Ky n Se- Then following the same argument as in the proof of Lemma |9l we conclude that the final 
fractional weight on e is at most /3 + 1 — a < 1. Furthermore, since Ku ^ Se Q K^, we either remove all 
these cuts from C or shift them off of edge e. Moreover, any new cuts that we shift on to e do not contain 
the meta-node M(rj) = M{u), and in particular do not contain the vertex n. Therefore at the end of this 
step, Se n Ku = 0. This also implies that following this iteration terminals in M{v) have depth larger than 
terminals in M{u), and so the next access to e must be from a terminal in M{v)- 

The same argument works when an edge moves from Xq to Xi . We again make use of the fact that prior 
to the step the fractional load on the edge is at most 1. ■ 

Lemma 11 During any iteration of the algorithm, for any edge e ^Y, the following are satisfied: 

• fe<l 

• If the edge e = (u, v) is accessed by a terminal i with € M{u), then following this iteration until 
the next time that e is accessed, we have Se H Ku = 0, and the next access to e (if any) is from a 
terminal in M{v)- 

• If a terminal i with € M{u) accesses e = {u,v), then ^ u, Ai (1 {u,v} = 0, and so i does 
not load e. Also, consider any previous access to the edge by a terminal in M{u); then prior to this 
access, rj M(n). 

Proof: The first two parts of this lemma extend Lemma[TO]to the case of e € y, and are otherwise identical 
to that lemma. The proof for these claims is analogous to the proof of Lemma [lOl The only difference is 
that terminals accessing an edge e € 1" default on this edge. However, this does not affect the argument: 
when a terminal defaults on the edge, the edge's fractional load changes in the same way as if the terminal 
did not default; the only change is in the way an integral cut is assigned to the terminal. Since these claims 
depend only on how the fractional load on the edge changes, they continue to hold while the edge is in Y- 



11 



For the third part of the lemma, since Ai C M{ri) = M{u) and v M{u), v ^ Ai. Next we show 
that u ^ Ai. Consider the iterations of the algorithm during which < 1. During this time the edge was 
accessed at least twice prior to being accessed by i (once when e moved from Xq to Xi , once when e moved 
from Xi to Y, and possibly multiple times while e G Y). Let the last two accesses be by the terminals ji 
and j2, at iterations ti and t2, ti < ^2- For a G {0, 1}, let M"(n) and M°-{v) denote the meta-nodes of u 
and V respectively just prior to iteration ta, and M(n) and M{v) denote the respective meta-nodes just prior 
to the current iteration. Then by Lemma [lO] and the second part of this lemma, we have rj-^ G M^(n) and 

G M'^{v). We claim that i >ci 32 >ci Ji- Given this claim, if r-i G M^{u) = M^(rjJ, then since 
i and ji have the same depth at iteration ti, we get a contradiction to the fact that the algorithm picks ji 
before i in Step|3al Therefore, r, ^ M(n) at any iteration prior to ti, and in particular, 7^ u. Finally, 
since u G Ui and J7j n = 0, this also implies that u ^ Ai. 

It remains to prove the claim. We will prove that j2 >ci ji- The proof for i >ci 32 is analogous. In 
fact we will prove a stronger statement: between iterations ti and t2, all terminals with cuts in dominate 
ji in the cut-inclusion ordering. We prove this by induction. By Lemma [TOl prior to iteration ti, Se does 
not contain any cuts belonging to terminals at M{v). Following the iteration, Se only contains fractional 
cuts in K}^ that got shifted on to the edge e. Prior to shifting, these cuts contain M^{u), and therefore r^^, 
but do not belong to ji . Then, these cuts are subsets of Oj^ , and so by the inclusion invariant, they belong 
to terminals dominating ji in the cut-inclusion ordering. Therefore, the claim holds right after the iteration 
ti. Finally, following the iteration until the next time that e is accessed (by 32), the set Se only shrinks, and 
so the claim continues to hold. ■ 

In order to analyze the loading of edges in Z, we need some more notation. Let M. denote the collection 
of sets of vertices that were meta-nodes at some point during the algorithm. For any edge e G let Me 
denote the meta-node formed when e enters Z; then Me is the smallest set in M. containing both the end 
points of e. Note that the collection A^J M.i?> laminar. 

Lemma 12 An edge e ^ Z is loaded only if after the formation of Me a terminal residing at a vertex 
in Me defaults on an edge in 5{Me). (Note that this may happen after Me has merged with some other 
meta-nodes.) 

Proof: Let i be a defaulting terminal that loads the edge e G Z. Then e G S{Ai), and therefore, Ai C Me 
and Tj G Me. Furthermore, since Ai is a strict subset of Me, Ui n Me / 0, and therefore, i defaults on an 
edge e' G y with at least one end-point in Mg. But if both the end-points of e' are in Me, then we must have 
= contradicting the fact that e' is in Y. Therefore, e' G 5{Me). ■ 



Lemma 13 For any meta-node M G ^A, after its formation, at most one terminal residing at a vertex in M 
can default on edges in S{M) (even after M has merged with other meta-nodes). 

Proof: For the sake of contradiction, suppose that two terminals i and j, both residing at vertices in M default 
on edges in d{M) after the formation of M, with i defaulting before j. Let Mi (M2) denote the meta-node 
containing M just before i (j) defaulted. Note that M C Mi C M2. Consider an edge e E Ej n 6{M) 
(recall that Ej is the set of edges that j defaults on, so this set is non-empty by our assumption). Then 
e G 6{M) n 6{M2) C 6{Mi). Therefore, at the time that i defaulted, e was accessed by i, and by the third 
claim in Lemma [TT] rj Mi. This contradicts the fact that rj G M. ■ 

Finally we can put all these lemmas together to prove our main result on algorithm Round-2. 
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Input: Graph G ~ (V, E) with edge capacities Ce, commodities Si, - ■ ■ , Sk, common sink t, a feasible solution d to 
the program IMCP-LPI 

Output: A fractional laminar family of cuts C that is feasible for G with edge capacities Ce + o(l). 



1. For every a e [k] and terminal i ^ Sa do the following: Order the vertices in G in increasing order of 
their distance under da from 7-;. Let this ordering be = fj, i^i, ■ ' ■ j ^n- Let Ci be the collection of cuts 
{vo,vi,--- ,Vb}, one for each b G [n], da{ri,Vb) < 1, with weights w{{vq,--- ,Vb}) = da{ri,Vb+i) - 
dai^i, Vb). Let C denote the collection {Ci}igUaSa- 

2. Let N = nfc. Round up the weights of all the cuts in C to multiples of 1 /N'^, and truncate the collection so that 
the total weight of every sub-collection Ci is exactly 1. Also split every cut with weight more than 1 /iV^ into 
multiple cuts of weight exactly 1 /N^ each, assigned to the same commodity. 

3. While there are pairs of cuts in C that cross, consider any pair of cuts Ci, Cj G C belonging to terminals i j 
that cross each other. Transform these cuts into new cuts for i and j according to Figure|5] 



Figure 4: Algorithm Lam-1 — Algorithm to convert an LP solution for the CSCP into a feasible fractional 
laminar family 



Lemma 4 Given a fractional laminar cut family C feasible for the MCP on a graph G with integral edge 
capacities Cg, the algorithm Round-2 produces an integral family of cuts A that is feasible for the MCP on 
G with edge capacities + 3. 

Proof: We first note that the third part of LemmafTTlimplies that for all i, C/j, and therefore the algorithm 
never aborts. Then Lemma [8] implies that we get a feasible cut packing. Finally, note that every edge starts 
out in the set goes through one or more of the X^'s, a € {0, 1}, potentially goes through Y, and ends 
up in Z. An edge e enters Y when its integral load becomes Cg + 2. Lemma [TT] implies that edges in Y 
never get loaded, and so at the time that an edge e enters Z, < Ce + 2. After this point the edge stays in 
Z, and Lemmas [12] and [T3] imply that it gets loaded at most once. Therefore, the final load on the edge is at 
most Ce + 3. ■ 



4 Constructing fractional laminar cut packings 

We now show that fractional solutions to the program IMCP-LPl can be converted in polynomial time into 
fractional laminar cut families while losing only a small factor in edge load. We begin with the common 
sink case. 

4.1 Obtaining laminarity in the common sink case 

We prove Lemma [T] in this section. Our algorithm involves starting with a solution to IMCP-LPI converting 
it into a feasible fractional non-laminar family of cuts, and then resolving pairs of crossing cuts one at a 
time by applying the rules in Figure |5] The algorithm is given in Figure |4] 
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Lemma 1 Consider an instance of the CSCP with graph G = (V, E), common sink t, edge capacities 
Ce, and commodities Si, - ■ ■ , Sk- Given a feasible solution d to IMCP-LP] algorithm Lam- 1 produces in 
polynomial time a fractional laminar cut family C that is feasible for the CSCP on G with edge capacities 

Ce + o(l). 

Proof: We first note that the family C is feasible for the given instance of CSCP at the end of Step[2l but is 
not necessarily laminar. Since the number of distinct cuts in C after Step[T]is at most nk = N, at the end of 
Step 121 edge loads are at most Cg + As we tranform the cuts in Step[3l we maintain the property that 
no cut C £ C contains the sink t, but every cut C £ Ci contains the node rj for terminal i. It is also easy 
to see from Figure [5] that the load on every edge stays the same. Finally, in every iteration of this step, the 
number of pairs of crossing cuts strictly decreases. Therefore, the algorithm ends after a polynomial number 
of iterations. ■ 



4.2 Obtaining laminarity in the general case 

Obtaining laminarity in the general case involves a more careful selection and ordering of rules of the form 
given in Figure [5] The key complication in this case is that we must maintain separation of every terminal 
from every other terminal in its commodity set. We first show how to convert an integral collection of cuts 
feasible for the MCP into a feasible integral laminar collection of cuts. We lose a factor of 2 in edge loads 
in this process (see Lemma [141 below). Obtaining laminarity for an arbitrary fractional solution requires 
converting it first into an integral solution for a related cut-packing problem and then applying Lemma [141 
(see algorithm Lam-2 in Figure[8land the proof of Lemma |2l following it). 

Lemma 14 Consider an instance of the MCP with graph G = {V, E)and commodities Si, - ■ ■ , Sk, and let 
= {Gl}i^g^ae[k] a family of cuts such that for each a € [k] and i E Sa, Cj contains i but no other 
j G Sa- Then algorithm Integer-Lam-2 produces a laminar cut collection = {Cf}i^s^^a&[k] such that for 
each a G [k] and i ^ j £ Sa, either Gf or Cj separates ifrom j, and i'^^ < for every edge e G E. 

In the remainder of this section we interpret cuts as sets of vertices as well as sets of terminals residing 
at those vertices. The algorithm for laminarity in the integral case is given in Figure [6l 
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Input: Graph G = {V, E) with edge capacities Ce, commodities 5*1, ■• • , Sk, a family of cuts C with one cut for every 
terminal in Ua5a, such that the cut for terminal i G Sa does not contain any terminal j ^ i in Sa- 
Output: A laminar collection of cuts, one for each terminal in UaSa, such that for all a and for all i, j G Sa, i ^ j, 
either the cut for i or the cut for j separates i from j. 



1. While there are pairs of cuts in C that cross, do (see Figure|7]l: 

(a) Consider any pair of cuts C,; ,Cj G C belonging to terminals i ^ j that cross each other, such that 
ri G Ci \ Cj and G Q \ Ci. Reassign d = Ci\ Cj and Cj = C-j \ Ci. Return to Step[T] 

(b) Consider any three terminals ii,i2,«3 with cuts Ci,C2 and C3 such that ri^ G Ci fl C2 \ C3, r^^ G 
C2 n C3 \ Ci, and r^., G C3 n Ci \ C2. Then, reassign these respective intersections to the three terminals. 
Return to Step[T] 

(c) Consider any pair of cuts Ci, Cj G C belonging to terminals i, j G Sa for some a that cross each other, 
such that ri G Ci n Cj and G C^ \ C^. Reassign Ci = Ci n C-j and Cj = Ci U Cj. Return to Step[T] 

(d) Consider any pair of cuts Ci , Cj G C belonging to terminals i j that cross each other, such that 

ri, rj G Ci n Cj, i & Sa and G S'b with a^h. 

• Suppose that there is no i' G SaC\Cj with Ci C Ci' . Then, reassign Ci = Ci U Cj and Cj = Ci n Cj ; 
return to Step[T] Conversely, if there is no j' G S*;, fl Ci with Cj C Cj' . Then, reassign Cj — Ci U Cj 
and Ci = Ci n Cj; return to Step[T| (This transformation is similar to Step [Tel) 

• If neither of those cases hold, let = i, and let ii, • • • , i^; denote the terminals in Sa H Cj with 

C, C C C^, C ■ • • C C,^. For x' < a; - 2, reassign Q^, = (Q^,^^ \ Cj) U C,^,, C,_, = 
Ci^ U Cj, and Ci^ = Ci^ n Cj \ Ci^_ ^ . Reassign cuts to j and terminals in Sh n Ci likewise. Return 
to Step[T] 

(e) If none of the above rules match, then go to Step|2] 

2. Let C/ be a directed graph on the vertex set \^aSa, with edges colored red or blue, defined as follows: for 
terminals i j, Q contains a red edge from i to j if and only if Cj C Ci, and contains a blue edge from i to j 
if and only if rj G Ci, ri ^ Cj, and Cj \Ci ^ 9. We note that since no pair of terminals i and j matches the 
rules in Stepfl] whenever Ci and Cj intersect Q contains an edge between i and j. 

While there is a directed blue cycle in Q, consider the shortest such cycle ii ^ i2 ^ ■ ■ ■ ^ ix ^ h- For 
x' < X, x' ^ 1, assign to i^' the cut Ci^, fl Ci^,_^, and assign to ii the cut Ci^ fl Ci^. 

3. We show in LemmafTSlthat at this step Q is acyclic. For every connected component in Q do: 

(a) Let T be the set of terminals in the component and A be the set of corresponding cuts. Assign capacities 
Pe — 2£f to edges in G. Let Gp be the graph obtained by merging all pairs of vertices that have an edge 
e with Pe — between them. We call the vertices of Gp "meta-nodes" (note that these are sets of vertices 
in the original graph). At any point of time, let Ri denote the meta-node at which a terminal i resides. 

(b) While there are terminals in T, pick any "leaf" terminal i (that is, a terminal with no outgoing red or blue 
edges in Q). Reassign to i the cut Ri. Reduce the capacity of every edge e G S{Ri) by 1. Remove i from 
T; remove i and all edges incident on it from Q. Recompute the graph Gp based on the new capacities. 



Figure 6: Algorithm Integer-Lam-2 — Algorithm to convert an integral family of multiway cuts into a laminar 
one 
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Figure 7: Some simple rules for resolving crossing cuts. See algorithm Integer-Lam-2 in Figure|6]for formal 
descriptions. 



As in the common sink case, the algorithm starts by applying a series of simple rules to pairs of crossing 
cuts while maintaining the invariant that pairs of terminals belonging to the same commodity are always 
separated by at least one of the two cuts assigned to them. Certain kinds of crossings of cuts are easy to 
resolve while maintaining this invariant (Step[T]of the algorithm resolves these crossings; see also Figure|7]l. 
In Steps [2] and [3l we ignore the commodities that each terminal belongs to, and assign new laminar cuts 
to terminals while ensuring that the new cut of each terminal lies within its previous cut (and therefore, 
separation continues to be maintained). These steps incur a penalty of 2 in edge loads. 

The rough idea behind Steps [2] and [3] is to consider the set of all "conflicting" terminals, call it F. 
Then we can assign to each terminal i ^ F the cut Cij^pCj where Cj is either the cut of terminal j or its 
complement depending on which of the two contains rj. These intersections are clearly laminar, and are 
subsets of the original cuts assigned to terminals. Furthermore, if each terminal gets a unique intersection, 
then edge loads increase by a factor of at most 2. Unfortunately, some groups of terminals may share the 
same intersections. In order to get around this, we assign cuts to terminals in a particular order suggested by 
the structure of the conflict graph on terminals (graph Q in the algorithm) and assign appropriate intersections 
to them while explicitly ensuring that edge loads increase by a factor of no more than 2. 

Throughout the algorithm, every terminal in UaSa has an integral cut assigned to it. The proof of 
Lemma [T4l is established in three parts: Lemma [TS] establishes the laminarity of the output cut family, 
Lemma [TT] argues separation, and Lemma [TSl analyzes edge loads. 

Lemma 15 Algorithm Integer-Lam-2 runs in polynomial time and produces a laminar cut collection. 

Proof: As in the previous section define the crossing number of a family of cuts to be the number of pairs 
of cuts that cross each other. We first note that in every iteration of Steps [U and |2] of the algorithm, the 
crossing number of the cut family C strictly decreases: no new crossings are created in these steps, while the 
crossings of the two or more cuts involved in each transformation are resolved (see Figure |7]l. Therefore, 
after a polynomial number of steps, we exit Steps [T] and [2] and go to Step [3] 

Next, we claim that during Step [3] of the algorithm the graph Q is acyclic. This implies that while Q is 
non-empty, we can always find a leaf terminal in Step [3l therefore every terminal in Q gets assigned a new 
cut. It is immediate that the graph does not contain any directed blue cycles or any directed red cycles (the 
latter follows because red edges define a partial order over terminals). Suppose the graph contains three 
terminals ii, 12 and ^3 with a red edge from ii to ^2, and a red or blue edge from 12 to ^3, then it is easy to see 
that there must be a red or blue edge from ii to 23. Therefore, any multi-colored directed cycle must reduce 
to either a smaller blue cycle or a cycle of length 2. Neither of these cases is possible (the latter is ruled out 
by definition), and therefore the graph cannot contain any multi-colored cycles. 
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Now consider cuts assigned during Step [3] Let T be the set of terminals corresponding to some compo- 
nent in Q and j ^ T. Then before T is processed, j's cut is laminar with respect to all the cuts in At, and is 
therefore a subset of some meta-node in GpT. So the new cuts assigned to terminals in T are also laminar 
with respect to j's cut. 

Finally, consider any two cuts assigned during Step |3] of the algorithm and belonging to two terminals 
in the same component of Q. Consider the set of all meta-nodes created during this iteration of Step|3] This 
set is laminar, and the cuts assigned during this iteration are a subset of this laminar family. Therefore, they 
are laminar. ■ 



Lemma 16 For a commodity i assigned a cut in Step^of algorithm Integer-Lam-2, let C} be its cut before 
this step, and Cf be the new cut assigned to it Then Cf C Cf. 

Proof: We assume without loss of generality that prior to Step |3] each edge load is at most one; this can be 
achieved by splitting a multiply-loaded edge into many edges. We focus on the behavior of the algorithm 
for a single component T of G and prove the lemma by induction over time. 

Consider an iteration of Step [3b] during which some terminal i e T is assigned and let Cj be its original 
cut. Consider any vertex v Ci and let P be a shortest simple path from Vi to v in GpT (where the length 
of an edge e is given by just prior to when i is assigned a new cut). It is easy to see that there is one such 
shortest path that crosses each new cut assigned prior to this iteration in Step |3b] at most twice - suppose 
there are multiple entries and exits for some cut, we can "short-cut" the path by connecting the first point on 
the path inside the cut to the last point on the path inside the cut via a simple path of length lying entirely 
inside the cut. We pick P to be such a path. We will prove that P's length is at least 2. So the meta-node 
containing i must lie inside the cut Ci, and the lemma holds. 

Let Ti (resp. T2) be the set of terminals in T \ Cj (resp. T n Cj) that are assigned new cuts before i in 
this iteration. We first note that for any j in Ti, prior to this step, there is no edge from j to i (as j is assigned 
before i), so Cj, and this along with rj Ci implies that Ci and Cj are disjoint. This implies that the 
new cut of j (which is a subset of Cj by induction) is also disjoint from Cj, and therefore cannot load any 
edge with an end-point in Ci. So the only new cuts assigned this far in Step[3b]that load edges in P belong 
to terminals in T2. 

Now we will analyze P's length by accounting for all the newly assigned cuts that load its edges. Let 
Sp be the set of terminals in T2 that load an edge in P, and j e Sp. Since the new cut of j intersects P, by 
the induction hypothesis, Cj should either intersect P or contain the entire path inside it. If Cj contains P 
entirely, then Cj \Ci ^ %, and furthermore r^, rj € Cj fl Cj. This implies that either Ci C Cj and there is 
a directed red edge from j to i, or Cj \ Cj 7^ 0, that is, Cj and Cj cross and should have matched the rule in 
Step[Td]of the algorithm. Both possibilities lead to a contradiction. Therefore, Cj must intersect P. 

Finally, the original total length of the path is at least 2\Sp\ + 2, because each terminal in Sp contributes 
two units towards its length, and another two units is contributed by Cj. Out of these up to 2\Sp\ units of 
length is consumed by terminals in Sp. Therefore, at the time that i is assigned a cut, at least 2 units remain. 



Lemma 17 When algorithm Integer-Lam-2 terminates, for every a G [k] and i ^ j Sa, either Ci or Cj 
separates ifrom j. 

Proof: We claim that for every a G [k] and i ^ j £ Sa, at every time step during the execution of the 
algorithm, |Cj fl Cj n {ri,rj}\ < 1. Then since by Lemma [T5] the final solution is laminar, the lemma 
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follows. We prove this claim by induction over time. First, if during any iteration of the algorithm, we 
"shrink" the cut of any terminal (that is, reassign to the terminal a cut that is a strict subset of its original 
cut), then the claim continues to hold for that terminal, because intersections of the terminal's cut only shrink 
in that step. Note that cuts of terminals expand only in Steps [Tc] and [Id] of the algorithm (by construction 
and by Lemma [T6l). 

Suppose that during some iteration we apply the transformation in Step[Tc]to terminals i and j, reassign- 
ing Cj = Cj U Cj, and the claim fails to hold for terminal j. Specifically, suppose that for some / S Sa, 
after the iteration we have rj,rj/ e Cj n Cj'. Then, rj G Cj/, and therefore Cj/ intersected Cj prior to 
the iteration, and by the induction hypothesis rj/ G Cj \ Cj prior to the iteration. If rj € Cj', then prior 
to the iteration, i and / contradicted the induction hypothesis. Otherwise, i, j and j' satisfy the conditions 
in Step[Tb]of the algorithm, and this contradicts the fact that we apply the transformation in Step[Tc]at this 
iteration. 

Next suppose that during some iteration we apply the transformation in the first part of Step [id] to 
terminals i and j, reassigning Cj = Cj U Cj, and the claim fails to hold for terminal j; in particular, for 
some j' e Sa, after the iteration we have rj,rj/ G Cj fl Cji. Then, since rj € Cj' and the pair of terminals 
did not match the criteria in Step[lcJ it must be the case that Cj C Cj' prior to the iteration. Furthermore, 
rj/ G Cj prior to the iteration and this contradicts the fact that we applied the transformation in the first part 
of Step [HI 

Finally, suppose that during some iteration we apply the transformation in the second part of Step [Tdl 
Then the cut assigned to every i^' for x' < x — 2 is a subset of the previous cut of i^'+i, but does not contain 
the latter terminal, and so by the arguments presented for the previous cases, once again the induction 
hypothesis continues to hold for those terminals. Furthermore, the cut assigned to is a subset of its 
original cut and i^-i does not belong to any of the new cuts except its own. The same argument holds for 
the jy' terminals. ■ 



Lemma 18 For the cut collection produced by algorithm Integer-Lam-2 the load on every edge is no more 
than twice the load of the integral family of cuts input to the algorithm. 

Proof: We first claim that edge loads are preserved throughout Steps [T] and |2] of the algorithm. This can 
be established via a case-by-case analysis by noting that in every transformation of these steps, the number 
of new cuts that an edge crosses is no more than the number of old cuts that the edge crosses prior to the 
transformation. It remains to analyze Step [3] of the algorithm. We claim that we only lose a factor of 2 in 
edge loads during this step of the algorithm. This is easy to see. Note that for every edge e, 'YlixPe ^ 2£gUT^ 
where Cut is the family of cuts belonging to terminals in any non-singleton component of Q prior to Step [3] 
Moreover, in each iteration of the step, we only load an edge e to the extent of pj. Therefore the lemma 
follows. ■ 

Proof of Lemma \T4\ The proof follows immediately from Lemmas [T5l [TT] and [TSl ■ 

Given this lemma, algorithm Lam-2 in Figure [8] converts an arbitrary feasible solution for lMCP-LPI into 
a feasible fractional laminar family. 



Lemma 2 Consider an instance of the MCP with graph G = {V, E), edge capacities Cg, and commodities 
Si, - ■ ■ , Sk- Given a feasible solution d to IMCP-LPI algorithm Lam-2 produces a fractional laminar cut 
family C that is feasible for the MCP on G with edge capacities 8ce + o(l). 
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Input: Graph G ~ (V, £') with edge capacities Cg, commodities S*!,--- ,Sk, a feasible solution d to the pro- 
gram lMCP-LPl 

Output: A fractional laminar family of cuts C that is feasible for G with edge capacities 8ce + o(l). 



1. For every a e [k] and every terminal i G 5a do the following: Order the vertices in G in increasing or- 
der of their distance under da from r^. Let this ordering be = ri,i;i, • ■ • ,i;„. Let C] be the collection 
of cuts {wqjUi,--- one for each b G [n] with da{ri,Vb) < 0.5, with weights w^dvo,--- ,Vb}) = 
2{mm{da{ri,Vb+i),Q.5} - da{ri,Vb)). LetC^ denote the collection {Cjji^u^Sa- 

2. Let N = n \'^a\- Round up the weights of all the cuts in to multiples of 1 /N'^, and truncate the collection 
so that the total weight of every sub-collection is exactly 1. Furthermore, split every cut with weight more 
than into multiple cuts of weight exactly each, assigned to the same commodity. Call this new 
collection with weight function w'^. Note that every cut in this collection has weight exactly 

3. Construct a new instance of MCP in the same graph G as follows. For each a G [k], construct new 
commodities with terminal sets identical to that of Sa (that is the terminals reside at the same nodes). For every 
new terminal corresponding to an older terminal i, assign to the new terminal a unique cut from Cf with weight 
1. Call this new collection C^, and the new instance /. 

4. Apply algorithm Integer-Lam-2 from Figure|6]to the family to obtain family C^. 

5. For every a G \k\ and every i ^ Sa, let Cf be the set of innermost cuts in assigned to terminals in the 
new instance / that correspond to terminal i. (Note that these cuts are concentric as they belong to a laminar 
family and all contain r^. Therefore "innermost" cuts are well defined.) Assign a weight of to every cut 
in this set. Output the collection C^. 



Figure 8: Algorithm Lam-2 — Algorithm to convert an LP solution into a feasible fractional laminar family 

Proof: Note first that the cut collection satisfies the following properties: (1) For every a G [k] and i ^ Sa, 
every cut in Cj contains r^, but not rj for j € Sa, j 7^ i', (2) The total weight of cuts in is 1; (3) For every 
edge e, (.^^ < 2X]^(ia(e) < 2ce. The family also satisfies the first two properties, however loads the 
edges slightly more than C^. Any edge belongs to at most N cuts, and therefore the load on the edge goes up 
by an additive amount of at most 1/A^. Therefore, for every e, £f < 2ce A-\/N. Next, the collection is 
a feasible integral family of cuts for the new instance / with £^ = N'^l^ . Therefore, applying Lemma [141 
we get that is a feasible laminar integral family of cuts for / with < 2A^^(2ce + 1/-/V)- Finally, in 
family C^, every terminal i £ Sa gets assigned fractional cuts, each with weight 2/N'^. Therefore, 

the total weight of cuts in Cf is 1. Now consider any two terminals i,j € Sa with i j. Then, in all the N"^ 
commodities corresponding to 5a in instance /, either the cut assigned to i's counterpart, or that assigned to 
j's counterpart separates i from j. Say that among at least A^^/2 of the commodities in I', the cut assigned 
to i's counterpart separates i from j. Then, the innermost cuts assigned to i in separate i from j. 

Therefore, the family satisfies the first two conditions of feasibility as given in Definition [2l Finally, it is 
easy to see that on every edge e, if < 2/N'^ff < 4(2ce + l/N). ■ 
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5 NP-Hardness 



We will now prove that CSCP and MCP are NP-hard. Since edge loads for any feasible solution to these 
problems are integral, the result of Theorem [5] is optimal for the CSCP assuming P/NP. The reduction in 
this theorem also gives us an integrality gap instance for the CSCP. 

Theorem 19 CSCP and MCP are NP-hard. Furthermore the integrality gap of lMCP-LPI is at least Ifor 
both the problems. 

Proof: We reduce independent set to CSCP In particular, given a graph G and a target k, we produce an 
instance of CSCP such that the load on every edge is at most 1 if and only if G contains an independent set 
of size at least k. Let n be the number of vertices in G. We construct G' by adding a chain ofn — k + l new 
vertices to G. Let the first vertex in this chain be t (the common sink) and the last be v. We connect every 
vertex of G to the new vertex v, and place a terminal i at every vertex ri in G (therefore, there are a total of 
n sources). We claim that there is a collection of n edge-disjoint — t cuts in this new graph G' if and only 
if G contains an independent set of size k. 

One direction of the proof is straightforward: if G contains an independent set of size k, say S, then 
for each vertex G S, consider the cut {rj}, and for each of the n — k source not in S, consider the cuts 
obtained by removing one of the n — k chain edges in G'. Then all of these n cuts are edge-disjoint. 

Next suppose that G' contains a collection of edge-disjoint cuts Q, with rj € Q and t ^ Gi for all i. 
Note that the number of cuts Cj containing any chain vertex is at most n — k because each of them cuts at 
least one chain edge. Next consider the cuts that do not contain any chain vertex, specifically v, and let T' 
be the collection of terminals for such cuts. These are at least k in number. Note that any cut Ci, i G T', 
cuts the edges {u, v) for u € Gi. Therefore, in order for these cuts to be edge-disjoint, it must be the case 
that Ci n Cj = for i,j G T', i ^ j. Finally, for two such cuts Cj and Cj, edge-disjointness again implies 
that and rj are not connected. Therefore the vertices for i ^ T' form an independent set in G of size at 
least k. 

For the integrality gap, let G be the complete graph and khe n/2. Then, there is no integral solution 
with load 1 in G'. However, the following fractional solution is feasible and has a load of 1: let the chain 
of vertices added to G be v = vi,V2, ■ ■ ■ , Wn/2+1 = assign to every terminal i, i £ [n], the cut {rj} with 
weight 1/2, and the cut V U {vq, ■ ■ ■ ,v^i/2\} with weight 1/2. ■ 



6 Concluding Remarks 

Given that our algorithms rely heavily on the existence of good laminar solutions, a natural question is 
whether every feasible solution to the MCP can be converted into a laminar one with the same load. Figure |9] 
shows that this is not true. The figure displays one integral solution to the MCP where the solid edges 
represent the cut for commodity a, and the dotted edges represent the cut for commodity b. It is easy to see 
that this instance admits no fractional laminar solution with load 1 on every edge. 

Is the "laminarity gap" small for the more general set multiway cut packing and multicut packing prob- 
lems as well? We believe that this is not the case and there exist instances for both of those problems with a 
non-constant laminarity gap. 
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Figure 9: Each edge has capacity 1. There are two commodities with terminal sets {00,01,02} and 
{bo,bi,b2}. 
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